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Abstract 

There has been a substantial amount of research on the relationship be- 
tween hippocampal neurogenesis and behaviour over the past fifteen years, but 
the causal role that new neurons have on cognitive and affective behavioural 
tasks is still far from clear. This is partly due to the difficulty of manipu- 
lating levels of neurogenesis without inducing off-target effects, which might 
also influence behaviour. In addition, the analytical methods typically used do 
not directly test whether neurogenesis mediates the effect of an intervention 
on behaviour. Previous studies may have incorrectly attributed changes in be- 
havioural performance to neurogenesis because the role of known (or unknown) 
neurogenesis-independent mechanisms were not formally taken into considera- 
tion during the analysis. Causal models can tease apart complex causal rela- 
tionships and were used to demonstrate that the effect of exercise on pattern 
separation is via neurogenesis-independent mechanisms. Many studies in the 
neurogenesis literature would benefit from the use of statistical methods that 
can separate neurogenesis-dependent from neurogenesis-independent effects on 
behaviour. 

Keywords: Bayesian graphical model, Behaviour, Causal model, Hippocampus, 
Neurogenesis, Stem cells 
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1 Introduction 



One of the advantages of laboratory-based studies is that it is often possible to 
experimentally control important variables that might influence the system under 
investigation. This is a luxury that some fields of biology (e.g. ecology) and many 
social sciences do not have. Instead, they often have to measure these relevant 
variables and use more complex statistical models to understand cause-and-effect 
relationships. Occasionally, some aspect of the system is not under complete exper- 
imental control, and it is necessary to use analytical methods developed in other 
fields to address key research questions. Such methods are often necessary when 
relating adult hippocampal neurogenesis to behaviour because factors that influ- 
ence neurogenesis such as age, hormones, stress, physical activity, environmental 
enrichment, anti-depressants, disease status, etc. have off-target effects, and there- 
fore establishing a causal relationship between neurogenesis and behaviour is not 
straightforward. Methods have been developed to test whether hypothesised causal 
relationships such as A affects B, and B affects C (but A does not affect C directly) 
are consistent with the data (denoted as A — > B — > C). Here, B is said to mediate 
the effect of A on C, and thus A's effect on C is indirect. However, a direct A — >• C 
effect might exist that does not involve B. How would one distinguish between these 
two situations? Or can A have an effect on C both via B and directly? If so, what 
proportion of the effect is via B? Causal models can be used to address these types 
of questions, but are rarely employed in studies of adult hippocampal neurogenesis. 
This means that the analyses do not directly address the question that the study 
was designed to test, and incorrect conclusions can be reached regarding how A 
affects C (i.e. directly, via B, or both). 

In a recent article, Creer et al. [l| examined the relationship between neuroge- 
nesis and performance on a pattern separation task. This task had two outcomes 
that were measured: the number of trials required to reach a set criterion, and the 
number of reversals achieved after a fixed number of attempts. Levels of neuroge- 
nesis were manipulated by placing a running wheel in the cages of some animals 
(runners), while controls had no running wheel. They showed that there was a 
relationship between neurogenesis and behaviour for the reversal data, and a non- 
significant trend in the predicted direction for the trials-to-criterion data. They 
concluded that because exercise (1) increased levels of neurogenesis, and (2) led to 
better performance on the behavioural task, that neurogenesis might be important 
for this task. However, this conclusion does not follow from the analysis because 
the effect of exercise on other aspects of neural functioning have not been taken into 
consideration. In other words, they concluded that A (exercise) affects B (neuro- 
genesis), which affects C (behaviour). What about the direct A — > C effect, which 
subsumes all of the neurogenesis independent effects? Could this completely account 
for the observed results? 

Figure I shows two potential models of the causal relationships (indicated by 
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arrows) between the treatment, neurogenesis, and behaviour. In the first model, 
the treatment (exercise vs. control) influences neurogenesis, which in turn influ- 
ences behaviour (performance on a pattern separation task). In the second model, 
neurogenesis does not have a direct causal effect on behavioural performance. The 
treatment influences neurogenesis, and it influences behaviour directly (or more 
likely via some other variable that was not measured, such as changes in the elec- 
trophysiological properties of the cells, spine density, etc). These two models make 
different predictions about relationships that will be found in the data, and therefore 
these predictions can be tested against the data. The model that correctly predicts 
the data will have the greater support, and as will be shown below, the evidence is 
strongly in favour of the neurogenesis-independent model. In addition to the two 
different predictions, these models also make three predictions which in common, 
and therefore these three predictions cannot be used to discriminate between the 
models. Unfortunately, many published studies only test the common predictions, 
and therefore they cannot provide support for a causal role for neurogenesis. 



2 Methods 

Data were accurately extracted from Figure 2B and 3C of the original publication 
[H using g3data software (www.frantz.fi/software/g3data.php). This reanalysis 
focuses mainly on the reversal data, since it had a significant relationship with 
neurogenesis. Analysis was conducted with R (version 2.12.2; 0, 0]). In order to 
confirm the accuracy of the data extraction, the same analysis as Creer et al., was 
performed. R 2 between neurogenesis and behavioural performance was determined 
to be 0.235, which compares favourably with the results reported in the original 
publication (R 2 = 0.236; percent error < 0.5%). The data and R code are provided 
in the supplementary material so that the analysis is documented and reproducible. 

Two methods were used in the reanalysis of the data. The first is from Baron 
and Kenny 0], which can be implemented with standard software, does not require 
advanced statistical knowledge, and is even discussed in introductory statistics text- 
books [H] ■ More recent developments and caveats of using this approach are discussed 
in references 0- 14 ] . In this simple experiment (only three variables with linear rela- 



tionships) the relevant questions can be addressed with t-tests, linear regression, and 
an analysis of covariance (ANCOVA) — all standard techniques that will be familiar 
to many biologists. An ANCOVA model is characterised by a continuous response 
variable with a combination of continuous and categorical predictor variables. In 
this case, performance on the pattern separation task was the response variable, 
neurogenesis was the continuous predictor, and treatment (exercise vs. control) was 
the categorical factor. In the output of such an analysis, the effect of neurogenesis is 
adjusted for the effect of the treatment, and the effect of the treatment is adjusted 
for the effect of neurogenesis, and these results are conveniently the two predictions 
which differ between the models. These tests are all just specific examples of a 
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linear model, and all of the above tests can be conducted within a linear modelling 
framework, which is a more unified method for data analysis 

BEE EH- All of the 

results are reported as regression coefficients from linear models, but the R code 
in the supplementary material contains the equivalent results presented as t-tests, 
regression, and ANCOVA. The key relationships are described with the following 
equations, where T, N, and B represent the variables Treatment, Neurogenesis and 
Behaviour, respectively. 



N = ai + AT + ei (1) 

B = a2 + (3 2 T + e 2 (2) 

B = a 3 + fcN + e 3 (3) 

B = a 4 + p 4 T + /3 5 iV + e 4 (4) 

Equations 1-3 test model predictions 1-3 in Figure 1, and Equation 4 tests model 
predictions 4 and 5. The a parameters represent the intercept for each model and are 
not of direct interest. The (3 parameters are the coefficients for the effect in question 
and the subscript corresponds to the model prediction that is being tested. The e's 
are the error terms (residuals). In the context of this experiment, the main interest is 
in the relationship between neurogenesis and behaviour, once the effect of exercise on 
behaviour has been removed (i.e. testing whether /3s is different from zero). A related 
question is whether the effect of exercise is mediated via neurogenesis (for example, if 
the main research question is "By what mechanism does exercise affect behaviour"). 
This can be determined by requiring that (1) the treatment affects neurogenesis, 
and (2) neurogenesis is associated with behaviour after adjusting for a direct effect 
of treatment on behaviour (i.e. /3\ and (3§ both need to be significant). However, 
this piecemeal approach is not ideal because two statistical tests are performed and 
the strength of the relationship is not quantified. An alternative is the product-of- 
coefficients method where (3\ x /3s needs to be significantly different from zero. One 
difficulty with this approach is estimating the standard error, or uncertainty, of the 
estimate jfjl, 0ji however, this can be easily estimated in the Bayesian analysis below. 

The second method uses a Bayesian graphical model to estimate causal relation- 
ships. This is a general method, which can be scaled up to more complex experi- 
mental designs, provide more robust estimates with small samples sizes, and allows 
latent variables and prior information from previous experiments to be included. In 
addition, direct probability statements can be made about the parameters, and the 



results can provide support for the null hypothesis 17]; traditional null hypothesis 
significance testing can only reject (or fail to reject) the null hypothesis, but cannot 
directly provide support for it. Graphical models are an active area of research with 
developments occurring in many fields, and as often happens, similar methods are 
developed with different names (e.g. structural equation modelling, path analysis, 
Bayesian networks, probabilistic graphical models, causal mediation analysis), along 
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with slightly different philosophies, emphases, assumptions, and underlying algo- 



rithms [18|42|. A general introduction to probabilistic graphical models usi ng g ene 



expression and cell signalling as examples are provided by Needham et al. [25|, [26| 



and Friedman [271 ] , and a more detailed description of these methods using a simple 
three- variable problem can be found in Yuan and MacKinnon |23| . Briefly, the hy- 
pothesised relationships are written as directed acyclic graphs (DAGs; e.g. Fig. 1), 
where each variable is represented as a node, and hypothesised causal relationships 
are represented by arrows. These graphical representations are then converted into 
a set of equations (similar to Equations 1 and 4). The joint distribution of all of the 
variables are factored into a set of simpler conditional distributions, much like the 
number 12 can be factored into 3x2x2. Graphical models with a different arrange- 
ment of hypothesised causal relationships will be factored in different ways (e.g. 12 
can also be factored into 6x2 and 4x3), following a few simple rules. If a variable 
has no arrows pointing into it (i.e. it is not dependent on another variable), then it 
is written as a regular probability density or distribution function (e.g. P(X) is the 
probability density function of a continuous variable X). Variables which have an 
arrow pointing into them are condition on those variables. For example, if there is 
an arrow pointing from Y to X, this would be written as P(X\Y), which is read as 
the probability of X given Y (the "|" means "given"). Thus, the joint probability of 
the three variables P(T, N, B) can be rewritten as a set of conditional distributions, 
which correspond to the two graphical models (Fig. 1). For Model 1, this can be 
written as 

P(T, N, B) = P(B\N)P(N\T)P(T), (5) 
whereas Model 2 would be written as 



P(T, N, B) = P(B\T)P(N\T)P(T), (6) 

with the only difference being whether B is conditioned on N or T, reflecting the two 
potential variables that affect behaviour. It is then possible to test whether there is 
any evidence in the data for the hypothesised causal structures represented by these 
equations, and to quantify the degree to which one model is better compared to the 
other. 

Analysis was conducted with the R2WinBUGS R package [28] and OpenBUGS, 
and the code is provided in the supplementary material. Non-informative priors were 
used and the results were not sensitive to the form of the prior (e.g. normal versus 
uniform). Three chains with 500,000 iterations each were used for the Markov-chain 
Monte Carlo (MCMC) sampling, with a burn in period of 250,000 iterations, and 
every tenth value was saved. The three chains were well mixed (Gelman-Rubin 
statistic < 1.01 for all parameters). The main parameter of interest was the effect 
of neurogenesis on behaviour after adjusting for the indirect effect of exercise on 
behaviour (/3s). In addition, the direct (,$4) and indirect (/?i x /%) effect of exercise 
on behaviour was determined. 
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Many of the requirements for using causal models are the same as for a standard 
analysis, including a large enough sample size to obtain reasonably precise esti- 
mates, important variables have not been omitted, random assignment of animals 
to groups, and randomised collection of data, processing of samples, and quantifi- 
cation to avoid time-dependent confounders. Causal models have a few additional 
assumptions. First, the mediator (i.e. neurogenesis) should ideally be measured 
without error 0]. Creer et al. expressed the neurogenesis data as a density mea- 
surement (neurons/mm 3 ), and so the extent to which values represent changes in 
cell number (which is of interest) or changes in the volume of the dentate gyrus is 
not clear. It is assumed that the reported estimates are an accurate representation 
of the number of cells and are not influenced by changes in volume between con- 
ditions. However, such assumptions can be misleading and are unnecessary when 
using modern design-based stereological methods Second, there should be 

no neurogenesis by treatment interaction, which was reasonable to assume with this 
data (F(l,16) = 0.17, p = 0.687), although more recent methods can handle inter- 
actions [13j]. Third, the residuals ei and £4 should not be correlated, and levels of 
neurogenesis and £4 should not be correlated, as this might indicate that another 
variable not included in the analysis is affecting levels of neurogenesis and behaviour 
(both correlations were almost zero p > 0.999). Finally, it is assumed that the arrows 
are pointing in the right direction. This may be difficult to check in general, but 
because this was a randomised experiment, we can be certain that levels of neuroge- 
nesis and behavioural performance did not affect assignment to different treatment 
groups. 



3 Results 

3.1 Predictions common to both models 

The first prediction common to both models is that levels of neurogenesis will differ 
between the runners and control mice. This is indeed the case, with mice in the 
exercise condition having approximately twice as many new cells (Fig 2A, (3% = 3.23, 
95% CI = 2.03 to 4.43, p < 0.001). 

The second prediction common to both models is that performance on the be- 
havioural task will differ between the runners and control mice. The neurogenesis- 
dependent model predicts this because the treatment influences neurogenesis, and 
levels of neurogenesis in turn influence performance on the pattern separation task. 
The neurogenesis-independent model predicts this because exercise influences pat- 
tern separation directly (or more likely via some unmeasured variable). The run- 
ners had more than twice as many reversals compared to the control group (Fig 2B, 
j3 2 = 0.96 , 95% CI = 0.40 to 1.52, p = 0.002), which is consistent with both models. 

The third prediction common to both models is that there will be a relationship 
between neurogenesis and behaviour. The neurogenesis-dependent model predicts 
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this because neurogenesis has a causal influence on pattern separation. This is the 
main research question that the study was designed to test. The problem is that the 
neurogenesis-independent model makes the exact same prediction. This is because 
neurogenesis and behaviour are influenced by the same variable (the treatment) 
which induces a correlation between them. This is the classical third- variable prob- 
lem, where A and B are correlated, but only because they share a common cause in 
variable C. The relationship between neurogenesis and behaviour is significant (slope 
of the solid line in Fig 2B: (3 3 = 0.18, 95% CI = 0.02 to 0.34, p = 0.030), and Creer 
et al. took this as evidence that neurogenesis might be involved in the pattern sep- 
aration task. However, since the neurogenesis-independent model makes the same 
prediction, further analyses are required to separate the neurogenesis-dependent 
from the neurogenesis-independent effects. 

3.2 Predictions that discriminate between models 

If neurogenesis is causally involved in the pattern separation task, then blocking the 
increase in neurogenesis caused by exercise will cause the performance of the run- 
ners and controls to be the same (that is, the relationship between treatment and 
behaviour will no longer be significant). We could imagine doing this experimen- 
tally, where the mice would be given a compound whose sole effect is to nullify the 
effect of exercise on neurogenesis (indeed, this approach is often taken, and studies 
with this design are discussed below). Exercise would not affect behaviour in this 
case because the causal link between treatment and behaviour has been broken by 
holding neurogenesis at a constant level. The neurogenesis-independent model how- 
ever predicts that the relationship between treatment and behaviour will still exist 
after keeping the levels of neurogenesis constant. This is clear from Figure 1; in the 
neurogenesis-independent model there is a direct causal link between the treatment 
and behaviour, so whether levels of neurogenesis are held fixed or not is irrelevant. 
It is also possible to statistically hold constant, or fix, levels of neurogenesis at some 
value (also referred to as "adjusting for", "taking into account", "controlling for", or 
"conditioning on") , and when this is done we find that there is still a significant rela- 
tionship between treatment and behaviour (difference between runners and controls: 
/3 4 = 1.06 reversals, 95% CI = 0.09 to 2.02, p = 0.034), thus providing support for 
the neurogenesis-independent model. 

The other prediction that differs between the models (prediction five in Fig. 1) 
is that if the neurogenesis-independent model is correct, then conditioning on the 
treatment will remove the correlation between neurogenesis and behaviour. This is 
clear from Figure 1, where it can be seen that in the neurogenesis-independent model, 
the link between neurogenesis and behaviour will be broken if we condition on the 
treatment. This can be thought of as removing a common cause of neurogenesis and 
behaviour, and testing whether there is still a relationship between them. However, 
the neurogenesis-dependent model predicts that there will still be a relationship 
between neurogenesis and behaviour because there is a direct causal link between 
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the two. The results show that there is no relationship between neurogenesis and 
behaviour when conditioned on treatment (/% = -0.03, 95% CI = -0.27 to 0.21, p 
= 0.788, represented as the slope of the two dashed lines in Figure 2C), which also 
supports the neurogenesis-independent model. Note that this is not due to a lack of 
statistical power, the value is close to zero, and is even in the opposite direction to 
what the neurogenesis-dependent model predicts (i.e. higher levels of neurogenesis 
were associated with worse performance, although not significantly so). This model 
is actually fitting a separate regression line for the runners and controls, rather than 
one regression line through all of the data points as in Figure 2C. The relationship 
between neurogenesis and behaviour needs to hold within each group, and should 
not be driven by differences between the group means of the two variables. 

The above analysis examined individual parameters to see whether they were 
different from zero, which is useful to address particular questions about the data. 
It is also possible to examine a model as a whole, by calculating a measure of model 
fit or adequacy. Two models can be fit to the same data and then compared, and the 
model with the better fit is preferred. Models that are more complex (have more pa- 
rameters) have greater freedom to better approximate the data and therefore will fit 
better than less complex models. Therefore the complexity of the models must also 
be taken into account when performing a comparison (in this case the two models had 
the same number of parameters). A number of methods have been developed for this 
type of model comparison, and the deviance information criterion (DIC; (32|) is one 
appropriate option for the Bayesian analysis. The lower the DIC for a model, the bet- 
ter the fit, and the larger the difference in DICs between two models, the better one 
model is versus the other. There are no strict rules regarding how large a difference 
is considered important, however a difference in DIC between 5-10 can be considered 
substantial (http : / / www . mrc-bsu . cam . ac . uk/bugs/winbugs/dicpage . shtml) . DIC 
for the neurogenesis-dependent model was 116.3, while for the neurogenesis-independent 
model it was 110.9. The difference of 5.4 therefore provides strong support for the 
neurogenesis-independent model. 

The above analysis pitted two competing models against each other that hy- 
pothesised different mechanisms for the effect of exercise on behaviour. While 
this is useful for explanatory purposes, it is likely that multiple mechanisms are 
at work (both neurogenesis and neurogenesis-independent), and the main research 
question should be "to what extent does neurogenesis contribute to behaviour". In 
other words, it would be useful to determine how much of exercise's effect is due 
to neurogenesis, and how much is due to neurogenesis-independent mechanisms. 
The relationship between exercise and behaviour was therefore decomposed into the 
indirect/neurogenesis-dependent (Exercise — > Neurogenesis — > Behaviour) and 
direct/neurogenesis-independent (Exercise — > Behaviour) paths, which were then 
estimated. This can be represented as 

P(T, N, B) = P(B\T, N)P(N\T)P(T), (7) 
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where the first term now has B conditioned on both T and N (compare with Equa- 
tions 5 and 6). The results are displayed in Figure 2D, which show the posterior 
densities for the two effects. These density plots represent the estimated effects along 
with the uncertainty associated with them. For example, the direct effect (DE) of 
exercise via neurogenesis-independent mechanisms is to increase the number of re- 
versals by approximately one (the peak of the distribution), with a 95% credible 
interval of 0.09 to 2.03 reversals (a 95% credible interval means that there is a 95% 
chance that the true effect is between the upper and lower values, this is often what 
confidence intervals are mistakenly believed to be). Since the lower interval excludes 
zero, we can conclude that there are neurogenesis-independent mechanisms at work 
(the associated p- value is 0.016). However, the indirect effect (IE) of exercise via 
neurogenesis on behaviour was very close to zero, and was slightly negative. In other 
words, the "best guess" of the effect of increasing neurogenesis is that it worsens per- 
formance (which is the same result as the first analysis) . If the effect of neurogenesis 
is actually zero, a negative value would be expected 50% of the time, so this is not 
surprising. It is clear from these results that exercise is not affecting behaviour via 
changes in levels of neurogenesis. 

The results are the same for the trials-to-criterion data. Creer et al. established 
that the first two predictions are supported by the data. The third prediction (a 
relationship between neurogenesis and behaviour) was not significant (/?3 = -3.79, 
95% CI = -8.74 to 1.15, p = 0.125), but they described it as a "trend". However, this 
relationship between neurogenesis and behaviour is greatly attenuated (/3s = -0.16, 
95% CI = -8.37 to 8.04, p = 0.967) when conditioned on treatment, indicating that 
there is little support for the neurogenesis-dependent model. 

Creer et al., concluded that "Taken together, our findings indicate that exercise- 
induced neurogenesis improves dentate gyrus-mediated encoding of distinct spatial 
representations", and it seems that this conclusion has been accepted by others 
Mil]. However, a reanalysis of their data using causal models — which can tease 



apart complex relationships and directly test whether neurogenesis mediates the 
effects of exercise on behaviour — clearly shows that the effect of exercise is via 
neurogenesis-independent mechanism(s). This is further supported by their find- 
ing in a separate experiment where aged runners (the reanalysed experiment used 
younger adults) did not have an increase in neurogenesis but still showed a small 
improvement on a modified version of the pattern separation task, which is exactly 
what the neurogenesis-independent model predicts. It should also be noted that 
the adult mice ran 23.5 ±1.79 km/day while the aged mice only ran 5.4 ±0.68 
km/day. Therefore, because aged animals ran 77% less, the effect of running (via 
neurogenesis-independent mechanisms) will be predicted to influence behaviour to 
a lesser extent. While this manuscript was under review, this group has published 
a review article discussing other factors that may mediate the effects of exercise 
and environmental enrichment on behaviour 38]. We need to go further however, 
and convert these pictorial representations of hypothesised causal relationships into 
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statistical models which can be tested against the data. 



4 Discussion 

Causal models are rarely used to analyse data from laboratory-based biological ex- 
periments, which is partly due to the high experimental control that can usually be 
achieved in such situations (lack of knowledge of these methods is another reason) . 
However, it is not always possible to create two experimental groups that differ only 
on the variable of interest. For example, the runners are not the same as the controls 
in all respects (even though they may have been at baseline due to randomisation), 
with the only exception of having higher levels of neurogenesis. If these other vari- 
ables are potential candidates for mediating the effect of the treatment, then it is 
not possible to make causal statements about the variable of interest. One could 
imagine the exact same study (and analysis) carried out by a second group whose in- 



terest is not in neurogenesis, but in spine density (also affected by exercise 39, 4C 
which was measured instead. Another group would look at levels of synaptic pro- 
teins, while a fourth would focus on glutamate receptors (both of which are also 



affected by exercise [4l|, l42|). Without the appropriate analysis, each group would 
conclude that their hypothesised variable is important for pattern separation, and 
all four would only have an incomplete picture. It is possible that a particular vari- 
able has no effect, and thus the wrong conclusion would be reached. If more than 
one variable is important, than all of the estimated effects will be biased because 
the total effect (via all mechanisms) is being estimated, rather than the individual 
effects separately. 

While there was no evidence for a causal role of neurogenesis in the present 
experiment, this does not rule out a contribution of neurogenesis to other behavioural 
tasks, or for the effect of decreasing neurogenesis on pattern separation (43| . Indeed, 
the causal relationships are likely to be more complex than suggested by this analysis, 
since angiogenesis was also affected by the treatment, and this introduces another 



variable by which the treatment might affect behaviour [441 ] . 



4.1 Manipulating neurogenesis often affects other relevant vari- 
ables 

If the goal of a study is only to demonstrate the effect of exercise on behaviour, then 
the concerns discussed herein do not apply. However, problems arise if the interest 
is in the effect of neurogenesis on behaviour, and exercise (or some other treatment) 
is used as a means to manipulate neurogenesis. Most methods of manipulating 
neurogenesis also affect other variables that are known (or could reasonably be 
expected) to contribute to behaviour (Table 1). For example, exercise affects spine 
density in the entorhinal cortex, CA1, and dentate gyrus [33, as well as levels 



of synaptic proteins 4lJ, glutamate receptors [42j], and various other genes, growth 
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factors, and neurotransmitters in the dentate gyrus [45l . |46[. If other factors are 
known to be involved, it is not possible to address the question "what is the effect of 
neurogenesis on behaviour" simply by looking at the relationship between the two. 
This will lead to severely biased results if the other relevant factors are not taken 
into account, as was demonstrated in the present study. Once it is known that 
exercise affects other aspects of the brain which might provide alternative causal 
explanations, future studies must take them into account, or at least rule them out as 
a potential contributing factor in any particular study. In addition to experimental 
manipulation, some studies use groups that naturally vary in levels of neurogenesis 
such as young versus old, or a disease model versus control. Here, neurogenesis is 
not manipulated directly, but clearly there are many differences between old and 
young brains, or in transgenic/knock-out mice versus controls that need to be ruled 
out. Importantly, anything that affects physical activity or locomotor behaviour 
in the home cage (e.g. a disease model, general health, age) might then influence 
performance on a behavioural task. 

This reanalysis not only reverses the conclusions of the original study, but brings 
into question the conclusion of many published studies purporting to demonstrate 
evidence of an association between neurogenesis and behaviour. This is because 
the basic design, analysis, and logic of the Creer et al. study is similar to many 
studies in the literature: manipulate neurogenesis and observe behaviour; if there 
are differences in both neurogenesis and behaviour between groups, conclude (ten- 
tatively) that neurogenesis might be causally involved. Furthermore, ignore other 
potential variables that might explain the results. The details of the studies vary 
(e.g. method of manipulating neurogenesis, type of behavioural task, etc.), but 
they follow the same basic template. The onus is on those reporting associations 
between neurogenesis and behaviour to demonstrate that neurogenesis-independent 
mechanisms were not at work, or at least quantify their magnitude and estimate the 
unique contribution that neurogenesis makes. For many studies this has not been 
done, and there is good reason to believe that neurogenesis-independent effects play 
a role in these studies (see Table 1 for a brief list). It is possible (and indeed not 
difficult) to design studies with biased effects that can be consistently reproduced 
ji/?! ]. and it is likely that this has occurred in the neurogenesis literature. The pos- 
sibility that neurogenesis-independent effects might play a role is acknowledged in 
many studies and review articles (e.g. Creer et al. noted that exercise is known 
to affect expression of neurotrophins, vascularisation, dendritic spine density, and 
synaptic plasticity), but the functional consequences of this when analysing the data 
and interpreting the results are generally ignored. If these off-target effects are not 
under experimental control, then the only option is to use more complex statistical 
models to account for them. 

Many studies only establish that the first two predictions in Figure 1 hold, and 
some studies such as Creer et al. test the third prediction, but given that these 
results are also entirely consistent with a system in which the behavioural task is 
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completely independent of neurogenesis, these results cannot support the hypothe- 
sis that hippocampal neurogenesis influences behaviour. Therefore statements such 
as "Thus, the positive correlation between enhanced neurogenesis and improved 
learning. . . remains a valid basis for the suggestion that newly generated cells may 
be important for memory function." [48], and "Clearly, the results of the past 15 
years of research support the idea of a functional role of adult-generated neurons 
in learning and memory processes. . . " [4^ |. need to be reconsidered, as they have 
less empirical support than commonly supposed. It should be noted that these are 
not poorly designed or flawed studies; the analysis simply does not directly test the 
main research question, or take into account potential confounding variables, which 
can lead to incorrect conclusions. Recent review articles comment on the conflicting 
results in the literature 5(J- 54 ] , and often suggest that this is due to some interaction 
between the method of manipulating neurogenesis, the method of measuring neuro- 
genesis, the species or strain of animal used, differing demands of the behavioural 
task, the type of memory examined, etc. While these factors most likely play a 
role, conflicting results are not unexpected if important factors, often unmeasured, 
are influencing the results. It would be useful to reanalyse many of the key studies 
using causal models. These studies only estimated the total effect of their respective 
treatments on behaviour, and if any part of these effects were via neurogenesis- 
independent mechanisms, then the estimates were biased and attributed too much 
of the total effect to neurogenesis. If neurogenesis-independent effects were present, 
then separating the total effect into a direct (neurogenesis-independent) and indi- 
rect (neurogenesis-dependent) component would decrease the estimated influence 
of neurogenesis. Thus, studies that were initially statistically significant will have 
attenuated estimates and larger p- values. One can only speculate on the number 
of studies whose conclusions will have to be revised. To get a feeling for what this 
number might be, it is interesting to note that only five studies (including this re- 
analysis) [441.1 5 5145 71] used models that conditioned on important variables, and these 
have either found no association between neurogenesis and behaviour, or found that 
higher levels of neurogenesis were associated with worse behavioural performance. 

Recently, Sahay et al. published a very thorough paper which used genetic meth- 
ods to selectively increase the survival of new born neurons 581 ] . They found that 
animals in the high neurogenesis group were better able to discriminate between 
two similar contexts, suggesting improved pattern separation. However, perhaps 
the most interesting result was that there were no differences between the normal 
and high neurogenesis groups on the standard behavioural tests that numerous pre- 
vious studies showed to be sensitive to levels of neurogenesis, including the Morris 
water maze, novel object recognition, active place avoidance, forced swim test, and 
novelty-suppressed feeding. This suggests that previous studies might have been in- 
correctly attributing differences in behavioural performance to neurogenesis, while 
the actual effects were via neurogenesis-independent mechanisms. It is likely that 
genetic methods of manipulating hippocampal neurogenesis are "cleaner" in that 
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there are fewer off-target effects, however this is an assumption, and one which can 
be tested with the data at hand. In other words, whether neurogenesis-independent 
effects exist, and the magnitude of these effects, is an empirical question which can 
be easily addressed. 



4.2 More complex designs 

It is recognised that neurogenesis-independent factors can play a role, and some stud- 
ies explicitly state that they used two methods of decreasing neurogenesis to avoid 
off-target effects [5^. In addition, some studies (6ol-[6r3| go a step further and try to 
demonstrate causality by (1) increasing neurogenesis and showing this improves per- 
formance on a behavioural task, and then (2) inhibiting this increase in neurogenesis 
and showing that the behavioural improvement is lost. This is doing experimentally 
what the causal models do statistically, and can provide stronger evidence for a 
causal role of neurogenesis. These hypothesised causal relationships are shown with 
solid arrows in Figure 3. However, it is still possible for neurogenesis-independent 
mechanisms to completely explain such results. This could occur if the method 
of increasing neurogenesis directly affects behaviour (e.g. Exercise — > Behaviour 
dashed line in Figure 3), and the method of decreasing neurogenesis also directly 
affects behaviour (e.g. Corticosterone — > Behaviour dashed line), without neu- 
rogenesis playing a role (no Neurogenesis — > Behaviour arrow in Figure 3). For 
example, it is possible that exercise and corticosterone affect behaviour directly, 
and also affect neurogenesis as a by-product. The relationships that actually exist 
in the data can be tested, and there is no need to speculate about which model or 
interpretation of the data is correct. The use of causal models in such a situation 
provides a test of the assumption that neurogenesis-independent mechanisms are 
not operating, and if they are, their magnitude can be quantified. It should also 
be stressed that this requires no further experimentation, these relationships can 
be tested with the available data. The supplementary material contains the results 
of a simulation study where it is demonstrated that using the standard analytical 
methods, it is not possible to distinguish between a situation where the behavioural 
outcome is completely dependent on neurogenesis from a situation where neuroge- 
nesis plays no role. The causal modelling approach can clearly distinguish these 
two situations. In addition, these graphical representations are intuitive, make the 
hypothesised relationships explicit, highlight the implied assumptions (e.g. indepen- 
dence assumptions between variables), and can be converted into statistical models 
which can be used to make inferences. Furthermore, such model building is the way 



science advances [67|] 
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5 Conclusions 



Nakagawa and Hauber 68] suggested five statistical methods that should be used 
more often by neuroscientists (meta-analysis, mixed-effects modelling, multiple im- 
putation, model averaging, and Markov chain Monte Carlo), and to this list we could 
add a sixth "M": mediation analysis. One reason for the extensive research on hip- 
pocampal neurogenesis is that it is widely believed to play a role in some cognitive 
and affective behaviours. If this is true, then manipulating levels of neurogenesis 
is a logical approach for improving memory and treating depression. However, if 
neurogenesis does not have a causal role on behaviour, or if the role of neurogenesis 
is small compared to the neurogenesis-independent mechanisms of a treatment, then 
resources would be better spent elsewhere. It is necessary to establish whether ob- 
served relationships are causal, as well as their magnitude, especially if the ultimate 
aim is to manipulate the system for therapeutic ends. Simple methods exist and are 
routinely used in other fields, and there is no reason for their continued omission in 
the neurogenesis literature. 
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Table 1: A selection of off-target effects for commonly used methods of manipulating 
neurogenesis. Due to these off-target effects, it is not possible to make causal claims 
regarding the role that neurogenesis has on behaviour by simply examining the 
relationship between neurogenesis and behaviour — more complex models are needed. 



Affects neurogenesis 


Off-target effects 


References 


Stress / Corticosterone 


Hippocampal volume 


[69] 




Dendrites / spines 


[69, 70] 


Exercise 


Spine density 


[39,40] 




Synaptic proteins 


r a -t i 

[41] 




Glutamate receptors 


[42] 




Plasticity genes 


[45] 




Growth factors 


[46] 


Environmental Enrichment 


Dendrites / spines 


[71-73] 




BDNF 


[74J 


MAM* 


General health 


[75] 




Locomotor activity 


[75] 


Imipramine 


Dendrites / synapses 


[69, 76, 77J 


Fluoxetine 


Dendrites / spines 


[69, 78] 


Irradiation 


NMDA receptors 


[29] 




Protein expression 


[80, 81] 




Inflammation /vasculature 


[82-84] 




DNA damage 


[85] 



*Methylazoxymethanol Acetate 
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Neurogenesis-dependent 
model (Ml) 



Neurogenesis-independent 
model (M2) 



Treatment 
(Exercise vs. Control) 



I 



Treatment 
(Exercise vs. Control) 



Neurogenesis 



I 



Neurogenesis 



Behaviour 
(Pattern Separation) 



Behaviour 
(Pattern Separation) 



Model predictions: 



Ml M2 



2. Behaviour differs between treatment groups (B2) 



4. Conditioned on neurogenesis, exercise & behaviour are independent (B4) 



✓ 4 
✓ 



Figure 1: Two hypothetical causal models that explain the relationship between 
neurogenesis and behaviour. In the neurogenesis-dependent model, an experimen- 
tal treatment affects the levels of neurogenesis, and altered levels of neurogenesis 
affect behavioural performance on a pattern separation task. In the neurogenesis- 
independent model, the experimental treatment affects performance on the be- 
havioural task independently of neurogenesis. These models make three predictions 
in common and two predictions which differ, and the latter can be tested against 
the data to provide support for one model versus the other. Arrows indicate hy- 
pothesised causal connections. /3's correspond to the coefficients that are tested in 
Equations 1-4. 
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Figure 2: Neurogenesis and behavioural data. Runners had significantly more new 
cells p < 0.001; A), and more reversals (/?2, P = 0.002; B) compared to the 
control group. Most papers in the literature that used a similar design stop the 
analysis here, incorrectly believing that this provides evidence that neurogenesis af- 
fects behaviour. There was also a significant correlation between the two outcomes 
(/?3, solid line, p = 0.030, C), but this is still insufficient to indicate that neurogenesis 
affects behaviour. The dashed lines show the (lack of a) relationship between neu- 
rogenesis and behaviour conditioned on treatment group (/?s). Posterior densities 
from the Bayesian analysis (D) show that neurogenesis had no effect on the number 
of reversals conditioned on treatment group (/3s, dashed line, corresponds to dashed 
lines in Panel C). Decomposing the total effect of exercise into the indirect effect 
(IE; via neurogenesis, f3\ x /3 5 ) and the direct effect (DE; neurogenesis- independent, 
^4) shows that neurogenesis-independent mechanisms are affecting behaviour (p = 
0.016). The peak of a distribution is tl^j^est estimate of the effect, and the width 
corresponds to the uncertainty in this estimate. 



Stimulant 



Exercise 
Enrichment 
Antidepressant 



I 



Neurogenesis 



I 



Behaviour 



Irradiation 
Corticosterone 
Anti-mitotic 
Genetic knock-down 



Figure 3: A more complex causal design. Some studies attempt to establish a causal 
role for neurogenesis by blocking the increase induced by exercise or environmental 
enrichment using irradiation, anti-mitotic agents, or genetically modified mice. The 
logic is that if neurogenesis is causally involved in behaviour, then animals in the 
stimulated+inhibited group (i.e. effects of stimulation are reduced or completely 
blocked) will have reduced behavioural performance compared to animals in the 
stimulated group. These causal effects are shown with the black solid arrows, with 
Neurogenesis — > Behaviour being the main effect of interest. However, this same 
relationship could be seen if the Neurogenesis — > Behaviour effect is zero, but 
the method of stimulating neurogenesis affects behaviour directly (Stimulant — > 
Behaviour) and the method of knocking down neurogenesis also negatively affects 
behaviour directly (Inhibitor — > Behaviour; dashed lines). The important point 
is that both the initial hypothesised causal relationships as well as the alternatives 
can be tested against the data. The supplementary material contains a further 
discussion. 
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Supplementary material 



An examination of "causal" experiments 

As mentioned in the main text (Section 4.2 and Figure 3), one method of attempting 
to establish causality is to manipulate neurogenesis in two ways, first by increasing 
neurogenesis and looking for a concomitant improvement in behaviour, and then by 
inhibiting this increase and looking for attenuated improvement on the behavioural 
task. Such studies can provide much stronger evidence that neurogenesis has a causal 
influence on behaviour. However, it is still possible that the experimental manipu- 
lations affect behaviour completely through neurogenesis- independent mechanisms. 
Moreover, it is not possible to establish to what extent neurogenesis is involved (if at 
all) using standard methods of analysis. Simulated data were used to demonstrate 
this important point. This is required because the causal relationships amongst 
the variables are known, having been generated by the experimenter. One can 
then check whether a particular analysis comes to the correct conclusion. Supple- 
mentary Figure 1 shows two causal models from a hypothesised experiment (one 
neurogenesis-dependent and the other neurogenesis- independent). These graphical 
models are related to Figure 3 in the main text. 



Neurogenesis-dependent 



Neurogenesis-independent 



Exercise 



Corticosterone 



Neurogenesis 



Behaviour 




Corticosterone 



Neurogenesis 




Figure 4: Data were generated from the neurogenesis-independent model and anal- 
ysed with both standard methods and the causal modelling approach. 



To make the example concrete, exercise will be the treatment that increases 
neurogenesis and corticosterone will be the treatment that decreases neurogene- 
sis. The data were generated such that the values and results are similar to what 
might be found in the literature. Let E, C, N, and B represent the variables 
Exercise, Corticosterone, Neurogenesis and Behaviour, respectively. Data for the 
neurogenesis-independent model can be generated using the following equations 
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N = ai +71^ + 72(7 + 61 
B = q 2 + 71 E + 72C + e 2 . 



(8) 
(9) 



The a parameters are not of interest and just determine the overall mean value of 
the data. 71 is the effect of exercise, which effects both neurogenesis and behaviour 
by ten units. 72 is the effect of corticosterone, which decreases both neurogenesis 
and behaviour by ten units (the effect of corticosterone is equal and opposite to the 
effect of exercise and thus they completely cancel each other) . The 7 parameters are 
known constants for the effect in question and are analogous to the /3 parameters 
in the main text (which were unknown and estimated from the data). The e's 
are the residuals and were drawn from a normal distribution with a mean of zero 
and a standard deviation of five. The sample size was forty, with ten animals in 
each condition. Levels of neurogenesis and performance on the behavioural task are 
in arbitrary units. The following parameter values were used: ol\ = 25, a 2 = 0, 
71 = 10, 72 = -10, ei and e 2 - Af(0, 5 2 ). 

For comparison, data from the neurogenesis-dependent model were also gener- 
ated with the following equations 



The same parameter values as above were used, and 73 is the effect of neurogenesis 
on behaviour, where a one unit increase in neurogenesis is associated with a one 
unit improvement in behavioural performance. Note that Equations f and 3 are 
identical, since neurogenesis is affected by exercise and corticosterone in both models. 
Equations 2 and 4 differ depending on the model; in Equation 4, behaviour is only 
affected by neurogenesis, whereas in Equation 2, behaviour affected by exercise and 
corticosterone, reflecting the causal relationships shown in Supplementary Figure 1. 
Supplementary Figure 2 plots the generated data from these two models using a 
bar graph with error bars representing ±1 standard error of the mean (SEM) — the 
standard method of plotting such data, along with a scatterplot of neurogenesis vs. 
behaviour. 

A remarkable feature is that the data from the neurogenesis-dependent and 
neurogenesis-independent models are indistinguishable. The main question is whether 
the standard analysis of the neurogenesis-independent data will be able to conclude 
that neurogenesis is not important for behaviour. 

Standard analysis of the neurogenesis-independent model data 

This data can be analysed with a 2-way ANOVA, with Exercise and Corticosterone 
as factors (which is also another example of a linear model) , and followed by posthoc 



N = ai + 7i£ + 72C + ei 
B = a 2 + 73A" + £2 



(10) 
(11) 
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Figure 5: Comparison of data simulated from a neurogenesis-dependent and 
neurogenesis-independent model. These graphs and standard analyses are not able 
to distinguish between these models. Bars represent means ±1 SEM. Cort = Cor- 
ticosterone 
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tests (Tukey's HSD), where all possible comparisons are done. This is not the recom- 
mended analysis, but it is commonly used, familiar, and thus useful for pedagogical 
purposes. A typical analysis and interpretation might proceed as follows: 

Exercise increased neurogenesis (Control/No Cort vs. Exer- 
cise/No Cort, p<0.001), which was blocked by the addition of 
corticosterone (Exercise/No Cort vs. Exercise/Cort, p<0.001). In 
addition, performance on the behavioural task was significantly 
improved by exercise (Control/No Cort vs. Exercise/No Cort, 
p<0.001), and this effect was blocked by corticosterone treatment 
(Exercise/No Cort vs. Exercise/Cort, p<0.001). Furthermore, 
there was a strong correlation between neurogenesis and behaviour 
(Pearson correlation = 0.731, p<0.001). 

Exercise increased neurogenesis and improved behavioural perfor- 
mance. By blocking the exercise-induced increase in neurogenesis 
with corticosterone, we were able to show that behavioural per- 
formance did not improve. Thus these results provide strong ev- 
idence that neurogenesis is the critical factor mediating the effect 
of exercise on behaviour. This is further supported by the strong 
neurogenesis-behaviour correlation. 

Recall that this analysis was based on data that was specifically created such 
that neurogenesis was not directly associated with behaviour (N does not appear in 
Equation 2). This correlation was induced because neurogenesis and behaviour share 
a common cause in both exercise and corticosterone. Thus, neither the group com- 
parisons nor the neurogenesis-behaviour correlation can provide evidence of causal- 
ity. Unfortunately, few analyses in the neurogenesis literature proceed further. If 
the data were actually generated from the neurogenesis-dependent model, then the 
above conclusions would be correct, but this would be based on luck and not a logical 
consequence of the analysis. In a real experiment, it is possible that the treatments 
affect behaviour partly through neurogenesis and partly through other mechanisms. 
If this is the case, the above analysis is not able to determine the extent to which 
neurogenesis is involved, which could range from to 100 percent. If the aim of a 
study is to determine the role of neurogenesis on behaviour, then not being able to 
estimate how much of the observed effect is due to neurogenesis is a serious limi- 
tation of the analysis. If the relevance of neurogenesis to a particular behavioural 
task could be anything, the reported results have not increased our understanding. 

A causal modelling approach 

Both models in Supplementary Figure 1 were fit to the same data as the above 
analysis (i.e. data for which neurogenesis was not influencing behaviour). The 
difference in DIC between models was 17.1 in favour of the neurogenesis-independent 
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model, which is overwhelming support this model. Furthermore, when a model was 
fit containing all hypothesised relationships (i.e. all the arrows in Figure 3 in the 
main text), the Neurogenesis — > Behaviour link was not significant (73 = 0.09, 
95% CI = -0.16 to 0.35, p = 0.370), and thus the correct conclusion was reached 
that there is little evidence for a causal Neurogenesis — > Behaviour relationship. 

Conclusion 

Standard methods of analysis cannot distinguish between a situation where the only 
influence on behaviour is neurogenesis from a situation where the effect of neuro- 
genesis is zero. The only question is how likely is it that neurogenesis-independent 
mechanisms are operating in any given study. In the main manuscript it is argued 
that there are known off-target effects for the commonly used methods of manipulat- 
ing neurogenesis. As new methods develop, there will likely be better experimental 
control of the system. However, there still exists a large body of literature that is 
difficult to interpret. 

Appendix 1: Data and R code 

## R code for Lazic SE (2011) . Using causal models to distinguish 
## between neurogenesis-dependent and -independent effects on 
## behaviour (main text) 



# Read in reversal and neurogenesis data 

d <- data. frame (reversals=c(0, 0.512, 0.512, 0.512, 0.512, 0.512, 
0.512, 1.488, 1.488, 0.512, 1, 1, 1.488, 1.488, 
1.488, 2, 2, 2, 1.976, 2.976) , 

neurogen=c (2 . 014 , 

4.892, 3.597, 2.95, 2.806, 2.23, 2.086, 2.95, 1.223, 
5.468, 9.424, 5.036, 5.827, 4.82, 4.46, 4.892, 
5.612, 6.691, 7.194, 6.331), 

condition=rep(0: 1, times=c(9, 11)) # = control, 1 = runner 



# replicate previous results (calculate R-square) 
cor .test (~reversals+neurogen, data=d) $estimate~2 
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# prediction 1: neurogenesis differs by condition 
LM1 <- lm(neurogen~condition, data=d) # Equation 1 
summary (LM1) # prints results 

confint(LMl) # calculate 95% CI 

# identical analysis using a t-test, only the sign of the t-statistic 

# is reversed (which is arbitrary) 

t .test (neurogen~condition, data=d, var . equal=TRUE) 



# prediction 2: behaviour differs by condition 

LM2 <- lm(reversals~condition, data=d) # Equation 2 
summary (LM2) 
conf int(LM2) 

# identical analysis using a t-test 

t .test (reversals~condition, data=d, var . equal=TRUE) 



# prediction 3: relationship between neurogenesis and behaviour 
LM3 <- lm(reversals~neurogen, data=d) # Equation 3 (regression) 
summary (LM3) 
conf int(LM3) 



# predictions 4 and 5 (different between models) 

LM4 <- lm(reversals~neurogen+condition, data=d) # Equation 4 

summary (LM4) 

conf int(LM4) 



# 

# Check assumptions 

# 1) test for homogeneity of regression (i.e there should be no 

# interaction between neurogenesis and condition) 
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LM5 <- lm(reversals~neurogen*condition, data=d) 
summary (LM5) 

# Interaction effect is not significant (p = 0.69) 



# 2) Residuals from LM1 and LM4 should not be correlated 
cor.test(resid(LMl) , resid(LM4)) # p = 0.999 

# 3) Neurogenesis should not be correlated with residuals from LM4 
cor.test(d$neurogen, resid(LM4)) # p = 0.999 



# 

# Bayesian analysis 

library (R2WinBUGS) 

# store data as a list 
data <- list(n=20, 

neurogen = d$neurogen, 
behave = d$reversals, 
cond = d$condition 
) 



# specify the model 
model <- function(){ 

# cond[i] = runner/control 

# neurogen [i] = neurogenesis 

# behave [i] = behaviour (reversal data) 

for(i in l:n) 
{ 

# effect of exercise on neurogenesis (Bl) . Corresponds to Eq. 1 in the 

# manuscript 

neurogen[i] ~ dnorm (mean. neurogen [i] , s2. neurogen) 
mean. neurogen [i] <- al + Bl*cond[i] 
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# effect of neurogenesis on behaviour adjusted for treatment (B5) and 

# effect of treatment on behaviour adjusted for neurogenesis (B4) 

# Corresponds to Eq. 4 in the manuscript 
behave [i] ~ dnorm (mean. behave [i] , s2. behave) 
mean. behave [i] <- a2 + B4*cond[i] + B5*neurogen [i] 

} 

# priors 

al ~ dnorm(0, 1.0E-5) 
a2 ~ dnorm(0, 1.0E-5) 
Bl ~ dnorm(0, 1.0E-5) 
B4 ~ dnorm(0, 1.0E-5) 
B5 ~ dnorm(0, 1.0E-5) 

s2.neurogen ~ dgamma(0 . 001 , 0.001) 
s2. behave ~ dgamma(0 . 001 , 0.001) 

# mediated effect 
med.eff <- B1*B5 

} 



# save model to external file 
write .model (model , "model . txt" ) 

# Parameters to save 

parms <- c("al" , "a2" , "Bl" , "B4" , "B5" , "s2 . neur ogen" , " s2 . behave " , "med . ef f " ) 

# initial values for MCMC sampling 
inits <- list( 

list (al=rnorm(l) , a2=rnorm(l) , Bl=rnorm(l) , B4=rnorm(l , , . 5) , 

B5=rnorm(l) , s2 .neurogen=rnorm(l , 1 ,0 . 05) , s2 .behave=rnorm(l , 1 ,0 . 05) ) , 
list (al=rnorm(l) , a2=rnorm(l) , Bl=rnorm(l) , B4=rnorm(l , , . 5) , 

B5=rnorm(l) , s2 .neurogen=rnorm(l , 1 ,0 . 05) , s2 .behave=rnorm(l , 1 ,0 . 05) ) , 
list (al=rnorm(l) , a2=rnorm(l) , Bl=rnorm(l) , B4=rnorm(l , , . 5) , 

B5=rnorm(l) , s2 .neurogen=rnorm(l , 1 ,0 . 05) , s2 .behave=rnorm(l , 1 ,0 . 05) ) 

) 
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# run analysis 

ml <- openbugs (model. f ile="model.txt" , data=data, inits=inits, 
parameters . to . save=parms , 
n.chains=3, n.iter = 500000, n.thin=10) 

# print results 
print (ml) 
ml$summary 



# 






mean 




sd 




2.5% 




25% 




50% 




75% 


97 . 5% 


Rhat 


n.eff 


# 


al 


2 


.7462 


0. 


.4485 


1 


,8588 


2 


.4554 


2. 


.7466 


3. 


.0383 


3 


.6265 


1 


.001 


75000 


# 


a2 


0. 


.7568 


0. 


.3964 


-0, 


,0224 


0, 


.4990 


0, 


.7557 


1 


.0127 


1 


.5396 


1 


.001 


75000 


# 


Bl 


3. 


.2312 


0. 


,6026 


2, 


.0410 


2, 


.8374 


3. 


.2314 


3, 


.6211 


4 


.4266 


1 


.001 


75000 


# 


B4 


1. 


.0576 


0. 


.4874 


0, 


,1012 


0, 


.7415 


1, 


.0561 


1. 


.3720 


2 


.0368 


1 


.001 


75000 


# 


B5 


-0 


.0309 


0. 


,1206 


-0, 


,2704 


-0, 


.1089 


-0, 


.0305 


0. 


.0476 





.2064 


1 


.001 


69000 


# 


s2 .neurogen 


0, 


.6188 


0. 


,2069 


0, 


.2815 





.4698 


0, 


.5954 


0, 


.7438 


1 


.0839 


1 


.001 


75000 


# 


s2 .behave 


2. 


.6714 


0. 


.9187 


1 


,1892 


2 


.0110 


2. 


.5675 


3, 


.2178 


4 


.7610 


1 


.001 


75000 


# 


med. ef f 


-0 


.0995 


0. 


.3966 


-0, 


.9028 


-0, 


.3437 


-0, 


.0947 


0. 


.1494 





.6833 


1 


.001 


70000 


# 


deviance 


105 


.7760 


4. 


,1243 


99 


.9155 


102 


.7592 


105 


.0595 


108 


.0029 


115 


.7389 


1 


.001 


75000 



# probablility direct effect (p = 0.016) 
sum(ml$sims.list$B4 < ) / length(ml$sims . list$B4) 

# note that the probability of the direct effect is actually 1 - 0.016 = 0.984, 

# meaning that there is a high probability of the direct effect. 1 - 0.984 = 0.016 

# was reported instead to keep with the convention of a small p-value indicating 

# the presence of a "significant" effect. 

# probability indirect effect (neurogenesis; p = 0.606) 
sum(ml$sims . list$med. ef f < ) / length(ml$sims.list$med.eff ) 
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